A Computational Platform for Development of Morphologic and Phonetic Lexica

نویسندگان

  • Matej Rojc
  • Zdravko Kacic
چکیده

Statistic approaches in speech technology, either based on statistical language models, trees, hidden Markov models or neural networks, represent the driving forces for the creation of language resources (LR), e.g. text corpora, pronunciation lexica and speech databases. This paper presents the system architecture for rapid construction of morphologic and phonetic lexica for Slovenian language. The integrated graphic user interface focuses in morphologic and phonetic aspects of the Slovenian language and allows the experts good performance in analysis time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

COLDIC, a Lexicographic Platform for LMF compliant lexica

Despite of the importance of lexical resources for a number of NLP applications (Machine Translation, Information Extraction, Event Detection and Tracking, Question Answering, among others), there has been a traditional lack of generic tools for the creation, maintenance and management of computational lexica. The most direct obstacle for the development of such generic tools, that is, independ...

متن کامل

Lexicon and Corpora for Speech to Speech Translation (LC-STAR)

The objective of the EU-project LC-STAR (Lexica and Corpora for Speech-to-Speech Translation Components) is corpora collection and lexica creation for the purposes of Automatic Speech Recognition (ASR) and Text-to-speech (TTS) that are needed in speech-to-speech translation (SST). During the lifetime of the project (2002-2005) these lexica will be specified, built and validated. Large lexica co...

متن کامل

Petra, osiris and molinspiration: A computational bioinformatic platform for experimental in vitro antibacterial activity of annulated uracil derivatives

Annulated pyrano[2,3-d]pyrimidine/pyrano[2,3-d]uracil derivatives were synthesized using aromatic aldehydes, active methylene compounds and barbituric acid in presence of dibutylamine (DBA) catalyst in ethanol as solvent. The different substituents on phenyl ring in the fused pyrano uracil skeleton showed productive influence on its antimicrobial activity against some gram positive and gram neg...

متن کامل

Specifications of Building Polish Lexica for Application in ASR and TTS Systems

This paper brings detailed information concerning the specifications of building Polish lexica of common and special application words for use in speech applications such as ASR (automatic speech recognition) or TTS (text-to-speech) synthesis. The specifications include information on the collection of text corpora and word lists, phonetic, grammatical and morphological annotation, as well as s...

متن کامل

Automatic Phonetic Transcription by Phonological Derivation

Automatic phonetic transcription tools usually perform phonetic transcriptions directly from orthographic representations. Although these approaches often achieve good results, theoretical studies suggest that including morphophonological knowledge allows those systems to improve their performance. Following this idea, we developed a tool which first obtains an underlying representation of each...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000